RNA sequencing data: biases and normalization
نویسندگان
چکیده
Motivations In recent years, RNA sequencing (RNA-seq) has rapidly become the method of choice for measuring and comparing gene transcription levels. Despite its wide application, it is now clear that this methodology is not free from biases and that a careful normalization procedure is the basis for a correct data interpretation. The most common normalization techniques account for: library size, gene or transcript length and sequence-specific biases such as GC-content effects. The aim of the present work is to investigate biases affecting RNA seq data and their effect on differential expression analysis. In order to reduce biases due to over-simplification of gene transcription models, we consider exon-based counts.
منابع مشابه
Using normalization to resolve RNA-Seq biases caused by amplification from minimal input.
RNA-Seq has become a widely used method to study transcriptomes, and it is now possible to perform RNA-Seq on almost any sample. Nevertheless, samples obtained from small cell populations are particularly challenging, as biases associated with low amounts of input RNA can have strong and detrimental effects on downstream analyses. Here we compare different methods to normalize RNA-Seq data obta...
متن کاملExperimental design, preprocessing, normalization and differential expression analysis of small RNA sequencing experiments
Prior to the advent of new, deep sequencing methods, small RNA (sRNA) discovery was dependent on Sanger sequencing, which was time-consuming and limited knowledge to only the most abundant sRNA. The innovation of large-scale, next-generation sequencing has exponentially increased knowledge of the biology, diversity and abundance of sRNA populations. In this review, we discuss issues involved in...
متن کاملPackage ‘ scran ’ April 15 , 2017
April 15, 2017 Version 1.2.2 Date 2017-01-18 Title Methods for Single-Cell RNA-Seq Data Analysis Maintainer Aaron Lun Depends R (>= 3.3.0), BiocParallel, scater Imports dynamicTreeCut, zoo, edgeR, stats, BiocGenerics, methods, Biobase, utils, Matrix, shiny, graphics, grDevices, statmod Suggests limSolve, limma, testthat, knitr, BiocStyle, org.Mm.eg.db, DESeq2, monocle, S4Vect...
متن کاملPackage ‘ scran ’ January 15 , 2017
January 15, 2017 Version 1.2.1 Date 2017-01-11 Title Methods for Single-Cell RNA-Seq Data Analysis Maintainer Aaron Lun Depends R (>= 3.3.0), BiocParallel, scater Imports dynamicTreeCut, zoo, edgeR, stats, BiocGenerics, methods, Biobase, utils, Matrix, shiny, graphics, grDevices, statmod Suggests limSolve, limma, testthat, knitr, BiocStyle, org.Mm.eg.db, DESeq2, monocle, S4Ve...
متن کاملPackage ‘ scran ’ January 31 , 2017
January 31, 2017 Version 1.2.2 Date 2017-01-18 Title Methods for Single-Cell RNA-Seq Data Analysis Maintainer Aaron Lun Depends R (>= 3.3.0), BiocParallel, scater Imports dynamicTreeCut, zoo, edgeR, stats, BiocGenerics, methods, Biobase, utils, Matrix, shiny, graphics, grDevices, statmod Suggests limSolve, limma, testthat, knitr, BiocStyle, org.Mm.eg.db, DESeq2, monocle, S4Ve...
متن کامل